Classification on High Dimensional Metabolic Data: Phenylketonuria as an Example
نویسندگان
چکیده
Tandem mass spectrometry is a promising new screening technology which permits screening within one analytical run not only for phenylketonuria (PKU) but also for a wide range of other metabolic disorders in newborns. We investigated two symbolic supervised machine learning techniques logistic regression analysis (LRA) and decision trees (DT), where the knowledge is represented in an explicit way to find classification rules for the presence of PKU. Our experiments were performed on pre-classified newborn screening data including a metabolite spectrum of 14 amino acids. LRA and DT classifiers showed high classification performance with a sensitivity of ≥ 97.7% and a specificity of ≥ 99.8%. In addition to the established diagnostic metabolites of phenylalanine and tyrosine, we also included alternative constellations of metabolites in our models showing comparable results in predictive power. The presented machine learning techniques are appropriate to investigate metabolic patterns in newborn screening data for constructing classification models for PKU.
منابع مشابه
Classification of Chronic Kidney Disease Patients via k-important Neighbors in High Dimensional Metabolomics Dataset
Background: Chronic kidney disease (CKD), characterized by progressive loss of renal function, is becoming a growing problem in the general population. New analytical technologies such as “omics”-based approaches, including metabolomics, provide a useful platform for biomarker discovery and improvement of CKD management. In metabolomics studies, not only prediction accuracy is ...
متن کاملAdult issues in phenylketonuria.
Phenylketonuria (PKU) is a classical example of an inherited metabolic disease, in which mental retardation can be prevented successfully by using a diet. However, in adult PKU new problems occur, such as vitamin deficiencies, osteoporosis and the maternal PKU syndrome. The aim of this review article is to provide guidelines for the clinician to understand and manage PKU in adults.
متن کاملSFLA Based Gene Selection Approach for Improving Cancer Classification Accuracy
In this paper, we propose a new gene selection algorithm based on Shuffled Frog Leaping Algorithm that is called SFLA-FS. The proposed algorithm is used for improving cancer classification accuracy. Most of the biological datasets such as cancer datasets have a large number of genes and few samples. However, most of these genes are not usable in some tasks for example in cancer classification....
متن کاملDesign and Development of a Minimum Data Set for Phenylketonuria Disease
Introduction: Phenylketonuria is one of the most common autosomal recessive metabolic diseases, characterized by a wide range of neuropsychological and neurocognitive disorders. Without proper care, control, and management, this disease can lead to severe mental retardation and neurobehavioral disorders. Therefore, the objective of this study was to design and develop a Minimum Data Set (MDS) f...
متن کاملHyperspectral Image Classification Based on the Fusion of the Features Generated by Sparse Representation Methods, Linear and Non-linear Transformations
The ability of recording the high resolution spectral signature of earth surface would be the most important feature of hyperspectral sensors. On the other hand, classification of hyperspectral imagery is known as one of the methods to extracting information from these remote sensing data sources. Despite the high potential of hyperspectral images in the information content point of view, there...
متن کامل